首页> 外文OA文献 >PELESent: Cross-domain polarity classification using distant supervision
【2h】

PELESent: Cross-domain polarity classification using distant supervision

机译:pELEsent:使用远程监督进行跨域极性分类

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The enormous amount of texts published daily by Internet users has fosteredthe development of methods to analyze this content in several natural languageprocessing areas, such as sentiment analysis. The main goal of this task is toclassify the polarity of a message. Even though many approaches have beenproposed for sentiment analysis, some of the most successful ones rely on theavailability of large annotated corpus, which is an expensive andtime-consuming process. In recent years, distant supervision has been used toobtain larger datasets. So, inspired by these techniques, in this paper weextend such approaches to incorporate popular graphic symbols used inelectronic messages, the emojis, in order to create a large sentiment corpusfor Portuguese. Trained on almost one million tweets, several models weretested in both same domain and cross-domain corpora. Our methods obtained verycompetitive results in five annotated corpora from mixed domains (Twitter andproduct reviews), which proves the domain-independent property of suchapproach. In addition, our results suggest that the combination of emoticonsand emojis is able to properly capture the sentiment of a message.
机译:互联网用户每天发布的大量文本促进了在几种自然语言处理领域中分析此内容的方法的开发,例如情感分析。该任务的主要目标是对消息的极性进行分类。即使已经提出了许多用于情感分析的方法,但一些最成功的方法还是依赖于大型带注释的语料库,这是一个昂贵且耗时的过程。近年来,远程监督已用于获取更大的数据集。因此,受这些技术的启发,在本文中,我们扩展了这些方法,以合并在电子消息中使用的流行图形符号,即表情符号,以创建葡萄牙语的大型情感语料库。在将近一百万条推文上进行了训练,在相同域和跨域语料库中测试了几种模型。我们的方法在来自混合域的五个带注释的语料库(Twitter和产品评论)中获得了非常有竞争力的结果,证明了这种方法的域独立属性。此外,我们的结果表明,表情符号和表情符号的组合能够正确捕获消息的情感。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号